Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 49
Filtrar
1.
Nat Commun ; 15(1): 2359, 2024 Mar 19.
Artigo em Inglês | MEDLINE | ID: mdl-38504097

RESUMO

Genetic mechanisms of blood pressure (BP) regulation remain poorly defined. Using kidney-specific epigenomic annotations and 3D genome information we generated and validated gene expression prediction models for the purpose of transcriptome-wide association studies in 700 human kidneys. We identified 889 kidney genes associated with BP of which 399 were prioritised as contributors to BP regulation. Imputation of kidney proteome and microRNAome uncovered 97 renal proteins and 11 miRNAs associated with BP. Integration with plasma proteomics and metabolomics illuminated circulating levels of myo-inositol, 4-guanidinobutanoate and angiotensinogen as downstream effectors of several kidney BP genes (SLC5A11, AGMAT, AGT, respectively). We showed that genetically determined reduction in renal expression may mimic the effects of rare loss-of-function variants on kidney mRNA/protein and lead to an increase in BP (e.g., ENPEP). We demonstrated a strong correlation (r = 0.81) in expression of protein-coding genes between cells harvested from urine and the kidney highlighting a diagnostic potential of urinary cell transcriptomics. We uncovered adenylyl cyclase activators as a repurposing opportunity for hypertension and illustrated examples of BP-elevating effects of anticancer drugs (e.g. tubulin polymerisation inhibitors). Collectively, our studies provide new biological insights into genetic regulation of BP with potential to drive clinical translation in hypertension.


Assuntos
Hipertensão , Proteoma , Humanos , Pressão Sanguínea/genética , Proteoma/genética , Proteoma/metabolismo , Transcriptoma/genética , Multiômica , Hipertensão/metabolismo , Rim/metabolismo , Proteínas de Transporte de Sódio-Glucose/genética , Proteínas de Transporte de Sódio-Glucose/metabolismo
2.
Commun Med (Lond) ; 3(1): 76, 2023 May 27.
Artigo em Inglês | MEDLINE | ID: mdl-37244961

RESUMO

BACKGROUND: Previous studies have demonstrated epidemiological trends in individual metastatic cancer subtypes; however, research forecasting long-term incidence trends and projected survivorship of metastatic cancers is lacking. We assess the burden of metastatic cancer to 2040 by (1) characterizing past, current, and forecasted incidence trends, and (2) estimating odds of long-term (5-year) survivorship. METHODS: This retrospective, serial cross-sectional, population-based study used registry data from the Surveillance, Epidemiology, and End Results (SEER 9) database. Average annual percentage change (AAPC) was calculated to describe cancer incidence trends from 1988 to 2018. Autoregressive integrating moving average (ARIMA) models were used to forecast the distribution of primary metastatic cancer and metastatic cancer to specific sites from 2019 to 2040 and JoinPoint models were fitted to estimate mean projected annual percentage change (APC). RESULTS: The average annual percent change (AAPC) in incidence of metastatic cancer decreased by 0.80 per 100,000 individuals (1988-2018) and we forecast an APC decrease by 0.70 per 100,000 individuals (2018-2040). Analyses predict a decrease in metastases to liver (APC = -3.40, 95% CI [-3.50, -3.30]), lung (APC (2019-2030) = -1.90, 95% CI [-2.90, -1.00]); (2030-2040) = -3.70, 95% CI [-4.60, -2.80]), bone (APC = -4.00, 95% CI [-4.30, -3.70]), and brain (APC = -2.30, 95% CI [-2.60, -2.00]). By 2040, patients with metastatic cancer are predicted to have 46.7% greater odds of long-term survivorship, driven by increasing plurality of patients with more indolent forms of metastatic disease. CONCLUSIONS: By 2040, the distribution of metastatic cancer patients is predicted to shift in predominance from invariably fatal to indolent cancers subtypes. Continued research on metastatic cancers is important to guide health policy and clinical intervention efforts, and direct allocations of healthcare resources.


Cancer that has spread beyond the area where it originated and into different organs is called metastatic cancer. This study analyzed trends in metastatic cancer incidence, the proportion of those with metastatic cancer surviving 5 years after diagnosis and the locations in the body each cancer had spread to. The incidence of metastatic cancer decreased between 1988 and 2018 and is expected to continue to decrease until 2040. Some of the most common locations cancer spreads to is the lung, liver, brain, and bone. Metastatic cancer incidence to these areas is predicted to decrease. Also, the likelihood of surviving for more than 5 years after diagnosis with metastatic cancer is predicted to increase by 2040. This research should facilitate optimal planning of future healthcare resources and policy.

3.
Nat Commun ; 14(1): 668, 2023 02 07.
Artigo em Inglês | MEDLINE | ID: mdl-36750564

RESUMO

Systemic lupus erythematosus is a heritable autoimmune disease that predominantly affects young women. To improve our understanding of genetic etiology, we conduct multi-ancestry and multi-trait meta-analysis of genome-wide association studies, encompassing 12 systemic lupus erythematosus cohorts from 3 different ancestries and 10 genetically correlated autoimmune diseases, and identify 16 novel loci. We also perform transcriptome-wide association studies, computational drug repurposing analysis, and cell type enrichment analysis. We discover putative drug classes, including a histone deacetylase inhibitor that could be repurposed to treat lupus. We also identify multiple cell types enriched with putative target genes, such as non-classical monocytes and B cells, which may be targeted for future therapeutics. Using this newly assembled result, we further construct polygenic risk score models and demonstrate that integrating polygenic risk score with clinical lab biomarkers improves the diagnostic accuracy of systemic lupus erythematosus using the Vanderbilt BioVU and Michigan Genomics Initiative biobanks.


Assuntos
Doenças Autoimunes , Lúpus Eritematoso Sistêmico , Humanos , Feminino , Estudo de Associação Genômica Ampla , Predisposição Genética para Doença , Fenótipo , Polimorfismo de Nucleotídeo Único
4.
Front Immunol ; 13: 889296, 2022.
Artigo em Inglês | MEDLINE | ID: mdl-35833142

RESUMO

Genome-wide association studies (GWAS) have identified hundreds of genetic variants associated with autoimmune diseases and provided unique mechanistic insights and informed novel treatments. These individual genetic variants on their own typically confer a small effect of disease risk with limited predictive power; however, when aggregated (e.g., via polygenic risk score method), they could provide meaningful risk predictions for a myriad of diseases. In this review, we describe the recent advances in GWAS for autoimmune diseases and the practical application of this knowledge to predict an individual's susceptibility/severity for autoimmune diseases such as systemic lupus erythematosus (SLE) via the polygenic risk score method. We provide an overview of methods for deriving different polygenic risk scores and discuss the strategies to integrate additional information from correlated traits and diverse ancestries. We further advocate for the need to integrate clinical features (e.g., anti-nuclear antibody status) with genetic profiling to better identify patients at high risk of disease susceptibility/severity even before clinical signs or symptoms develop. We conclude by discussing future challenges and opportunities of applying polygenic risk score methods in clinical care.


Assuntos
Doenças Autoimunes , Lúpus Eritematoso Sistêmico , Doenças Autoimunes/genética , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla/métodos , Humanos , Lúpus Eritematoso Sistêmico/diagnóstico , Lúpus Eritematoso Sistêmico/genética , Fatores de Risco
5.
J Mol Biol ; 434(15): 167693, 2022 08 15.
Artigo em Inglês | MEDLINE | ID: mdl-35777465

RESUMO

Human microbiome consists of trillions of microorganisms. Microbiota can modulate the host physiology through molecule and metabolite interactions. Integrating microbiome and metabolomics data have the potential to predict different diseases more accurately. Yet, most datasets only measure microbiome data but without paired metabolome data. Here, we propose a novel integrative modeling framework, Microbiome-based Supervised Contrastive Learning Framework (MB-SupCon). MB-SupCon integrates microbiome and metabolome data to generate microbiome embeddings, which can be used to improve the prediction accuracy in datasets that only measure microbiome data. As a proof of concept, we applied MB-SupCon on 720 samples with paired 16S microbiome data and metabolomics data from patients with type 2 diabetes. MB-SupCon outperformed existing prediction methods and achieved high average prediction accuracies for insulin resistance status (84.62%), sex (78.98%), and race (80.04%). Moreover, the microbiome embeddings form separable clusters for different covariate groups in the lower-dimensional space, which enhances data visualization. We also applied MB-SupCon on a large inflammatory bowel disease study and observed similar advantages. Thus, MB-SupCon could be broadly applicable to improve microbiome prediction models in multi-omics disease studies.


Assuntos
Metaboloma , Microbiota , Aprendizado de Máquina Supervisionado , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/microbiologia , Humanos , Doenças Inflamatórias Intestinais/genética , Doenças Inflamatórias Intestinais/microbiologia , Metabolômica/métodos , RNA Ribossômico 16S/genética
6.
Nat Commun ; 13(1): 3258, 2022 06 07.
Artigo em Inglês | MEDLINE | ID: mdl-35672318

RESUMO

Transcriptome-wide association studies (TWAS) are popular approaches to test for association between imputed gene expression levels and traits of interest. Here, we propose an integrative method PUMICE (Prediction Using Models Informed by Chromatin conformations and Epigenomics) to integrate 3D genomic and epigenomic data with expression quantitative trait loci (eQTL) to more accurately predict gene expressions. PUMICE helps define and prioritize regions that harbor cis-regulatory variants, which outperforms competing methods. We further describe an extension to our method PUMICE +, which jointly combines TWAS results from single- and multi-tissue models. Across 79 traits, PUMICE + identifies 22% more independent novel genes and increases median chi-square statistics values at known loci by 35% compared to the second-best method, as well as achieves the narrowest credible interval size. Lastly, we perform computational drug repurposing and confirm that PUMICE + outperforms other TWAS methods.


Assuntos
Estudo de Associação Genômica Ampla , Transcriptoma , Reposicionamento de Medicamentos , Epigenômica , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla/métodos , Genômica , Humanos , Polimorfismo de Nucleotídeo Único , Transcriptoma/genética
7.
JAMA Netw Open ; 5(1): e2145876, 2022 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-35099546

RESUMO

Importance: The results of studies evaluating spinal cord stimulation (SCS) for postlaminectomy syndrome (PLS) have yielded mixed results. This has led to an increased emphasis on objective outcome measures such as opioid prescribing. Objective: To determine the association between SCS and long-term opioid therapy (LOT) for PLS. Design, Setting, and Participants: In this cohort study, adults with PLS were identified using the TriNetx Diamond Network and separated based on whether they underwent SCS. Patients were stratified according to baseline opioid use (opioid-naive or receiving LOT) and subsequent opioid therapy over the 12-month period ranging from 3 to 15 months post-SCS implantation or post-PLS index date. Statistical analysis was performed from June to December 2021. Exposure: SCS. Main Outcomes and Measures: The main outcome was cessation of opioid use among patients receiving LOT or abstinence from opioids among opioid-naive patients. Opioid-naive patients were defined as those receiving at most 2 opioid prescriptions per year, and patients on LOT were those receiving at least 6 opioid prescriptions per year. Results: Among 552 937 eligible patients treated between December 2015 and May 2021, 26 179 with PLS received an SCS implant. The median (IQR) patient age was 60 (51-69) years; 305 802 patients (55.3%) were female. Among those reporting racial identify (37.0% [204 758 patients]), 9.3% (18 971 patients) were African American, 0.3% (648 patients) were Asian, and 90.4% (185 139 patients) were White. Compared with those who did not receive an SCS, individuals who received an SCS were more likely to be using opioids preimplantation (mean [SD] prescriptions: 4.3 [8.5] vs 4.1 [9.3]; P < .001) but less likely to be using opioids after SCS implantation (mean [SD] prescriptions: 3.8 [8.2] vs 4.0 [9.4]; P = .006). In the 12-month study period, similar proportions in the SCS and no-SCS groups receiving baseline LOT remained on LOT (70.3% [n = 74 585] vs 69.2% [n = 3882], respectively; P = .10). In opioid-naive patients, SCS was associated with a small decreased likelihood of patients subsequently receiving LOT (7.6% vs 7.0%; difference, -0.6% [95% CI, -1.0% to -0.2%]; P = .003). In multivariable analysis, SCS was associated with an increased likelihood of not being on opioids in both opioid-naive (adjusted odds ratio [OR], 0.90 [95% CI, 0.85-0.96]; P < .001) and LOT patients (adjusted OR, 0.93 [95% CI, 0.88-0.99]; P = .02). White patients were significantly more likely to be diagnosed with PLS (ie, underwent surgery) (90.4% vs 85.2%; difference, 5.2% [95% CI, 5.1%-5.4%]; P < .001) and receive an SCS (93.7% vs 90.3%; difference, 3.4% [95% CI, 2.9% to 4.0%]; P < .001) than patients of other racial identities. Conclusions and Relevance: These findings suggest that under real-life conditions, SCS was associated with small, clinically questionable associations with opioid discontinuation and not starting opioids in the context of PLS.


Assuntos
Analgésicos Opioides/uso terapêutico , Prescrições de Medicamentos/estatística & dados numéricos , Síndrome Pós-Laminectomia/terapia , Laminectomia/efeitos adversos , Padrões de Prática Médica/estatística & dados numéricos , Estimulação da Medula Espinal/estatística & dados numéricos , Idoso , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Análise Multivariada , Razão de Chances , Período Pós-Operatório , Implantação de Prótese
8.
Am J Hum Genet ; 109(1): 81-96, 2022 01 06.
Artigo em Inglês | MEDLINE | ID: mdl-34932938

RESUMO

Large-scale gene sequencing studies for complex traits have the potential to identify causal genes with therapeutic implications. We performed gene-based association testing of blood lipid levels with rare (minor allele frequency < 1%) predicted damaging coding variation by using sequence data from >170,000 individuals from multiple ancestries: 97,493 European, 30,025 South Asian, 16,507 African, 16,440 Hispanic/Latino, 10,420 East Asian, and 1,182 Samoan. We identified 35 genes associated with circulating lipid levels; some of these genes have not been previously associated with lipid levels when using rare coding variation from population-based samples. We prioritize 32 genes in array-based genome-wide association study (GWAS) loci based on aggregations of rare coding variants; three (EVI5, SH2B3, and PLIN1) had no prior association of rare coding variants with lipid levels. Most of our associated genes showed evidence of association among multiple ancestries. Finally, we observed an enrichment of gene-based associations for low-density lipoprotein cholesterol drug target genes and for genes closest to GWAS index single-nucleotide polymorphisms (SNPs). Our results demonstrate that gene-based associations can be beneficial for drug target development and provide evidence that the gene closest to the array-based GWAS index SNP is often the functional gene for blood lipid levels.


Assuntos
Exoma , Variação Genética , Estudo de Associação Genômica Ampla , Lipídeos/sangue , Fases de Leitura Aberta , Alelos , Glicemia/genética , Estudos de Casos e Controles , Biologia Computacional/métodos , Bases de Dados Genéticas , Diabetes Mellitus Tipo 2/genética , Diabetes Mellitus Tipo 2/metabolismo , Predisposição Genética para Doença , Genética Populacional , Estudo de Associação Genômica Ampla/métodos , Humanos , Metabolismo dos Lipídeos/genética , Fígado/metabolismo , Fígado/patologia , Anotação de Sequência Molecular , Herança Multifatorial , Fenótipo , Polimorfismo de Nucleotídeo Único
9.
Psychol Med ; 52(5): 968-978, 2022 04.
Artigo em Inglês | MEDLINE | ID: mdl-32762793

RESUMO

BACKGROUND: Substance use occurs at a high rate in persons with a psychiatric disorder. Genetically informative studies have the potential to elucidate the etiology of these phenomena. Recent developments in genome-wide association studies (GWAS) allow new avenues of investigation. METHOD: Using results of GWAS meta-analyses, we performed a factor analysis of the genetic correlation structure, a genome-wide search of shared loci, and causally informative tests for six substance use phenotypes (four smoking, one alcohol, and one cannabis use) and five psychiatric disorders (ADHD, anorexia, depression, bipolar disorder, and schizophrenia). RESULTS: Two correlated externalizing and internalizing/psychosis factor were found, although model fit was beneath conventional standards. Of 458 loci reported in previous univariate GWAS of substance use and psychiatric disorders, about 50% (230 loci) were pleiotropic with additional 111 pleiotropic loci not reported from past GWAS. Of the 341 pleiotropic loci, 152 were associated with both substance use and psychiatric disorders, implicating neurodevelopment, cell morphogenesis, biological adhesion pathways, and enrichment in 13 different brain tissues. Seventy-five and 114 pleiotropic loci were specific to either psychiatric disorders or substance use phenotypes, implicating neuronal signaling pathway and clathrin-binding functions/structures, respectively. No consistent evidence for phenotypic causation was found across different Mendelian randomization methods. CONCLUSIONS: Genetic etiology of substance use and psychiatric disorders is highly pleiotropic and involves shared neurodevelopmental path, neurotransmission, and intracellular trafficking. In aggregate, the patterns are not consistent with vertical pleiotropy, more likely reflecting horizontal pleiotropy or more complex forms of phenotypic causation.


Assuntos
Transtornos Mentais , Esquizofrenia , Transtornos Relacionados ao Uso de Substâncias , Pleiotropia Genética , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Transtornos Mentais/epidemiologia , Transtornos Mentais/genética , Fenótipo , Polimorfismo de Nucleotídeo Único , Esquizofrenia/epidemiologia , Esquizofrenia/genética , Transtornos Relacionados ao Uso de Substâncias/epidemiologia , Transtornos Relacionados ao Uso de Substâncias/genética
10.
JAMA Netw Open ; 4(10): e2127784, 2021 10 01.
Artigo em Inglês | MEDLINE | ID: mdl-34613403

RESUMO

Importance: Currently, there are limited published data regarding resource use and spending on cancer care in the US. Objective: To characterize the most frequent medical services provided and the associated spending for privately insured patients with cancer in the US. Design, Setting, and Participants: This cohort study used data from the MarketScan database for the calendar year 2018 from a sample of 27.1 million privately insured individuals, including patients with a diagnosis of the 15 most prevalent cancers, predominantly from large insurers and self-insured employers. Overall societal health care spending was estimated for each cancer type by multiplying the mean total spending per patient (estimated from MarketScan) by the number of privately insured patients living with that cancer in 2018, as reported by the National Cancer Institute's Surveillance, Epidemiology, and End Results program. Analyses were performed from February 1, 2018, to July 8, 2021. Exposures: Evaluation and management as prescribed by treating care team. Main Outcomes and Measures: Current Procedural Terminology and Healthcare Common Procedure Coding System codes based on cancer diagnosis code. Results: The estimated cost of cancer care in 2018 for 402 115 patients with the 15 most prevalent cancer types was approximately $156.2 billion for privately insured adults younger than 65 years in the US. There were a total of 38.4 million documented procedure codes for 15 cancers in the MarketScan database, totaling $10.8 billion. Patients with breast cancer contributed the greatest total number of services (10.9 million [28.4%]), followed by those with colorectal cancer (3.9 million [10.2%]) and prostate cancer (3.6 million [9.4%]). Pathology and laboratory tests contributed the highest number of services performed (11.7 million [30.5%]), followed by medical services (6.3 million [16.4%]) and medical supplies and nonphysician services (6.1 million [15.9%]). The costliest cancers were those of the breast ($3.4 billion [31.5%]), followed by lung ($1.1 billion [10.2%]) and colorectum ($1.1 billion [10.2%]). Medical supplies and nonphysician services contributed the highest total spent ($4.0 billion [37.0%]), followed by radiology ($2.1 billion [19.4%]) and surgery ($1.8 billion [16.7%]). Conclusions and Relevance: This analysis suggests that patients with breast, colorectal, and prostate cancers had the greatest number of services performed, particularly for pathology and laboratory tests, whereas patients with breast, lung, lymphoma, and colorectal cancer incurred the greatest costs, particularly for medical supplies and nonphysician services. The cost of cancer care in 2018 for the 15 most prevalent cancer types was estimated to be approximately $156.2 billion for privately insured adults younger than 65 years in the US.


Assuntos
Planos de Seguro com Fins Lucrativos/normas , Custos de Cuidados de Saúde/estatística & dados numéricos , Neoplasias/economia , Aceitação pelo Paciente de Cuidados de Saúde/estatística & dados numéricos , Adulto , Estudos de Coortes , Feminino , Planos de Seguro com Fins Lucrativos/estatística & dados numéricos , Humanos , Masculino , Pessoa de Meia-Idade , Neoplasias/epidemiologia , Estados Unidos/epidemiologia
11.
Nat Neurosci ; 24(10): 1367-1376, 2021 10.
Artigo em Inglês | MEDLINE | ID: mdl-34446935

RESUMO

Behaviors and disorders related to self-regulation, such as substance use, antisocial behavior and attention-deficit/hyperactivity disorder, are collectively referred to as externalizing and have shared genetic liability. We applied a multivariate approach that leverages genetic correlations among externalizing traits for genome-wide association analyses. By pooling data from ~1.5 million people, our approach is statistically more powerful than single-trait analyses and identifies more than 500 genetic loci. The loci were enriched for genes expressed in the brain and related to nervous system development. A polygenic score constructed from our results predicts a range of behavioral and medical outcomes that were not part of genome-wide analyses, including traits that until now lacked well-performing polygenic scores, such as opioid use disorder, suicide, HIV infections, criminal convictions and unemployment. Our findings are consistent with the idea that persistent difficulties in self-regulation can be conceptualized as a neurodevelopmental trait with complex and far-reaching social and health correlates.


Assuntos
Comportamento Aditivo/genética , Estudos de Associação Genética , Autocontrole , Transtorno do Deficit de Atenção com Hiperatividade/genética , Comportamento Aditivo/psicologia , Sintomas Comportamentais/genética , Sintomas Comportamentais/psicologia , Biologia Computacional , Crime/psicologia , Estudo de Associação Genômica Ampla , Infecções por HIV/genética , Infecções por HIV/psicologia , Humanos , Metanálise como Assunto , Herança Multifatorial , Análise Multivariada , Transtornos Relacionados ao Uso de Opioides/genética , Transtornos Relacionados ao Uso de Opioides/psicologia , Reprodutibilidade dos Testes , Suicídio , Desemprego
12.
Genome Res ; 31(9): 1629-1637, 2021 09.
Artigo em Inglês | MEDLINE | ID: mdl-34426515

RESUMO

The X Chromosome plays an important role in human development and disease. However, functional genomic and disease association studies of X genes greatly lag behind autosomal gene studies, in part owing to the unique biology of X-Chromosome inactivation (XCI). Because of XCI, most genes are only expressed from one allele. Yet, ∼30% of X genes "escape" XCI and are transcribed from both alleles, many only in a proportion of the population. Such interindividual differences are likely to be disease relevant, particularly for sex-biased disorders. To understand the functional biology for X-linked genes, we developed X-Chromosome inactivation for RNA-seq (XCIR), a novel approach to identify escape genes using bulk RNA-seq data. Our method, available as an R package, is more powerful than alternative approaches and is computationally efficient to handle large population-scale data sets. Using annotated XCI states, we examined the contribution of X-linked genes to the disease heritability in the United Kingdom Biobank data set. We show that escape and variable escape genes explain the largest proportion of X heritability, which is in large part attributable to X genes with Y homology. Finally, we investigated the role of each XCI state in sex-biased diseases and found that although XY homologous gene pairs have a larger overall effect size, enrichment for variable escape genes is significantly increased in female-biased diseases. Our results, for the first time, quantitate the importance of variable escape genes for the etiology of sex-biased disease, and our pipeline allows analysis of larger data sets for a broad range of phenotypes.


Assuntos
Genes Ligados ao Cromossomo X , Inativação do Cromossomo X , Alelos , Animais , Feminino , Genômica , Cromossomo X/genética
13.
G3 (Bethesda) ; 11(4)2021 04 15.
Artigo em Inglês | MEDLINE | ID: mdl-33713107

RESUMO

In microbiome research, metagenomic sequencing generates enormous amounts of data. These data are typically classified into taxa for taxonomy analysis, or into genes for functional analysis. However, a joint analysis where the reads are classified into taxa-specific genes is often overlooked. To enable the analysis of this biologically meaningful feature, we developed a novel bioinformatic toolkit, MetaPrism, which can analyze sequence reads for a set of joint taxa/gene analyses to: 1) classify sequence reads and estimate the abundances for taxa-specific genes; 2) tabularize and visualize taxa-specific gene abundances; 3) compare the abundances between groups; and 4) build prediction models for clinical outcome. We illustrated these functions using a published microbiome metagenomics dataset from patients treated with immune checkpoint inhibitor therapy and showed the joint features can serve as potential biomarkers to predict therapeutic responses. MetaPrism is a toolkit for joint taxa and gene analysis. It offers biological insights on the taxa-specific genes on top of the taxa-alone or gene-alone analysis. MetaPrism is open-source software and freely available at https://github.com/jiwoongbio/MetaPrism. The example script to reproduce the manuscript is also provided in the above code repository.


Assuntos
Metagenômica , Microbiota , Algoritmos , Biologia Computacional , Humanos , Metagenoma , Microbiota/genética , Análise de Sequência de DNA , Software
15.
Nat Commun ; 12(1): 1964, 2021 03 30.
Artigo em Inglês | MEDLINE | ID: mdl-33785739

RESUMO

Genome-wide association meta-analysis (GWAMA) is an effective approach to enlarge sample sizes and empower the discovery of novel associations between genotype and phenotype. Independent replication has been used as a gold-standard for validating genetic associations. However, as current GWAMA often seeks to aggregate all available datasets, it becomes impossible to find a large enough independent dataset to replicate new discoveries. Here we introduce a method, MAMBA (Meta-Analysis Model-based Assessment of replicability), for assessing the "posterior-probability-of-replicability" for identified associations by leveraging the strength and consistency of association signals between contributing studies. We demonstrate using simulations that MAMBA is more powerful and robust than existing methods, and produces more accurate genetic effects estimates. We apply MAMBA to a large-scale meta-analysis of addiction phenotypes with 1.2 million individuals. In addition to accurately identifying replicable common variant associations, MAMBA also pinpoints novel replicable rare variant associations from imputation-based GWAMA and hence greatly expands the set of analyzable variants.


Assuntos
Algoritmos , Biologia Computacional/métodos , Estudo de Associação Genômica Ampla/métodos , Metanálise como Assunto , Modelos Genéticos , Polimorfismo de Nucleotídeo Único , Estudos de Associação Genética/métodos , Genótipo , Fenótipo , Reprodutibilidade dos Testes , Tamanho da Amostra , Software
16.
Gigascience ; 10(2)2021 02 05.
Artigo em Inglês | MEDLINE | ID: mdl-33543271

RESUMO

BACKGROUND: Trillions of microbes inhabit the human body and have a profound effect on human health. The recent development of metagenome-wide association studies and other quantitative analysis methods accelerate the discovery of the associations between human microbiome and diseases. To assess the strengths and limitations of these analytical tools, simulating realistic microbiome datasets is critically important. However, simulating the real microbiome data is challenging because it is difficult to model their correlation structure using explicit statistical models. RESULTS: To address the challenge of simulating realistic microbiome data, we designed a novel simulation framework termed MB-GAN, by using a generative adversarial network (GAN) and utilizing methodology advancements from the deep learning community. MB-GAN can automatically learn from given microbial abundances and compute simulated abundances that are indistinguishable from them. In practice, MB-GAN showed the following advantages. First, MB-GAN avoids explicit statistical modeling assumptions, and it only requires real datasets as inputs. Second, unlike the traditional GANs, MB-GAN is easily applicable and can converge efficiently. CONCLUSIONS: By applying MB-GAN to a case-control gut microbiome study of 396 samples, we demonstrated that the simulated data and the original data had similar first-order and second-order properties, including sparsity, diversities, and taxa-taxa correlations. These advantages are suitable for further microbiome methodology development where high-fidelity microbiome data are needed.


Assuntos
Microbiota , Redes Neurais de Computação , Simulação por Computador , Humanos , Processamento de Imagem Assistida por Computador , Proteínas
17.
J Invest Dermatol ; 141(6): 1493-1502, 2021 06.
Artigo em Inglês | MEDLINE | ID: mdl-33385400

RESUMO

Psoriasis and type 2 diabetes (T2D) are complex conditions with significant impacts on health. Patients with psoriasis have a higher risk of T2D (∼1.5 OR) and vice versa, controlling for body mass index; yet, there has been a limited study comparing their genetic architecture. We hypothesized that there are shared genetic components between psoriasis and T2D. Trans-disease meta-analysis was applied to 8,016,731 well-imputed genetic markers from large-scale meta-analyses of psoriasis (11,024 cases and 16,336 controls) and T2D (74,124 cases and 824,006 controls), adjusted for body mass index. We confirmed our findings in a hospital-based study (42,112 patients) and tested for causal relationships with multivariable Mendelian randomization. Mendelian randomization identified a causal relationship between psoriasis and T2D (P = 1.6 × 10‒4, OR = 1.01) and highlighted the impact of body mass index. Trans-disease meta-analysis further revealed four genome-wide significant loci (P < 5 × 10‒8) with evidence of colocalization and shared directions of effect between psoriasis and T2D not present in body mass index. The proteins coded by genes in these loci (ACTR2, ERLIN1, TRMT112, and BECN1) are connected through NF-κB signaling. Our results provide insight into the immunological components that connect immune-mediated skin conditions and metabolic diseases, independent of confounding factors.


Assuntos
Diabetes Mellitus Tipo 2/genética , Loci Gênicos/imunologia , Psoríase/genética , Índice de Massa Corporal , Causalidade , Diabetes Mellitus Tipo 2/epidemiologia , Diabetes Mellitus Tipo 2/imunologia , Predisposição Genética para Doença/epidemiologia , Estudo de Associação Genômica Ampla , Humanos , Análise da Randomização Mendeliana , NF-kappa B/metabolismo , Polimorfismo de Nucleotídeo Único , Psoríase/epidemiologia , Psoríase/imunologia , Transdução de Sinais/genética , Transdução de Sinais/imunologia
18.
Nat Genet ; 52(12): 1314-1332, 2020 12.
Artigo em Inglês | MEDLINE | ID: mdl-33230300

RESUMO

Genetic studies of blood pressure (BP) to date have mainly analyzed common variants (minor allele frequency > 0.05). In a meta-analysis of up to ~1.3 million participants, we discovered 106 new BP-associated genomic regions and 87 rare (minor allele frequency ≤ 0.01) variant BP associations (P < 5 × 10-8), of which 32 were in new BP-associated loci and 55 were independent BP-associated single-nucleotide variants within known BP-associated regions. Average effects of rare variants (44% coding) were ~8 times larger than common variant effects and indicate potential candidate causal genes at new and known loci (for example, GATA5 and PLCB3). BP-associated variants (including rare and common) were enriched in regions of active chromatin in fetal tissues, potentially linking fetal development with BP regulation in later life. Multivariable Mendelian randomization suggested possible inverse effects of elevated systolic and diastolic BP on large artery stroke. Our study demonstrates the utility of rare-variant analyses for identifying candidate genes and the results highlight potential therapeutic targets.


Assuntos
Pressão Sanguínea/genética , Frequência do Gene/genética , Predisposição Genética para Doença/genética , Hipertensão/genética , Fator de Transcrição GATA5/genética , Estudo de Associação Genômica Ampla , Genótipo , Humanos , Mutação/genética , Fosfolipase C beta/genética , Polimorfismo de Nucleotídeo Único/genética
19.
Bioinformatics ; 36(19): 4951-4954, 2020 12 08.
Artigo em Inglês | MEDLINE | ID: mdl-32756942

RESUMO

SUMMARY: Here, we present a highly efficient R-package seqminer2 for querying and retrieving sequence variants from biobank scale datasets of millions of individuals and hundreds of millions of genetic variants. Seqminer2 implements a novel variant-based index for querying VCF/BCF files. It improves the speed of query and retrieval by several magnitudes compared to the state-of-the-art tools based upon tabix. It also reimplements support for BGEN and PLINK format, which improves speed over alternative implementations. The improved efficiency and comprehensive support for popular file formats will facilitate method development, software prototyping and data analysis of biobank scale sequence datasets in R. AVAILABILITY AND IMPLEMENTATION: The seqminer2 R package is available from https://github.com/zhanxw/seqminer. Scripts used for the benchmarks are available in https://github.com/yang-lina/seqminer/blob/master/seqminer2%20benchmark%20script.txt. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Bancos de Espécimes Biológicos , Software , Genótipo , Humanos
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA
...